Learning Human Utility from Video Demonstrations for Deductive Planning in Robotics
نویسندگان
چکیده
We uncouple three components of autonomous behavior (utilitarian value, causal reasoning, and fine motion control) to design an interpretable model of tasks from video demonstrations. Utilitarian value is learned from aggregating human preferences to understand the implicit goal of a task, explaining why an action sequence was performed. Causal reasoning is seeded from observations and grows from robot experiences to explain how to deductively accomplish subgoals. And lastly, fine motion control describes what actuators to move. In our experiments, a robot learns how to fold t-shirts from visual demonstrations, and proposes a plan (by answering why, how, and what) when folding never-beforeseen articles of clothing.
منابع مشابه
Tying Surgical Knots From Demonstration: Enhancing Demonstrations and Correcting Errors During Execution
Recent advances in the modeling of deformable objects such as surgical suture, rope, and hair show significant promise for improving the manipulation of such objects. An important application of these tasks lies in the area of medical robotics, where robotic surgical assistants have the potential to greatly reduce surgeon fatigue and human error by improving the accuracy, speed, and robustness ...
متن کاملEvaluation of Midwifery Student's Attitude, Performance and Satisfaction from teaching clinical skills with the Video in Hamedan School of Nursing and Midwifery (2019)
1. Duncan I, Yarwood-Ross L, Haigh C..YouTube as a source of clinical skills education. Nurse Eduction. .2013; 33 (12): 1576–1580 2. Arguel ., Jamet E. Using video and static pictures to improve learning of procedural contents.Comput. Hum. Behav.2008; 25 (2):354–359. 3. Johnson N, List-Ivankovic J, Eboh W, Ireland ., Adams D, Mowatt E, Martindale S. Research and evidence based pra...
متن کاملLeveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards
We propose a general and model-free approach for Reinforcement Learning (RL) on real robotics with sparse rewards. We build upon the Deep Deterministic Policy Gradient (DDPG) algorithm to use demonstrations. Both demonstrations and actual interactions are used to fill a replay buffer and the sampling ratio between demonstrations and transitions is automatically tuned via a prioritized replay me...
متن کاملLearning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations
Dexterous multi-fingered hands are extremely versatile and provide a generic way to perform multiple tasks in human-centric environments. However, effectively controlling them remains challenging due to their high dimensionality and large number of potential contacts. Deep reinforcement learning (DRL) provides a model-agnostic approach to control complex dynamical systems, but has not been show...
متن کاملInverse Reinforcement Learning via Ranked and Failed Demonstrations
In many robotics applications, applying reinforcement learning (RL) can be especially difficult, as it depends on the prespecification of a reward function over the environment’s states, which is often hard to define. Inverse Reinforcement Learning (IRL) [1] attempts to address this problem, by utilizing human demonstrations to learn the reward function, without having a human explicitly define...
متن کامل